NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Holistic Evaluation for Interleaved Text-and-Image Generation

https://doi.org/10.18653/v1/2024.emnlp-main.1228

Liu, Minqian; Xu, Zhiyang; Lin, Zihao; Ashby, Trevor; Rimchala, Joy; Zhang, Jiaxin; Huang, Lifu (January 2024, Association for Computational Linguistics)

Interleaved text-and-image generation has been an intriguing research direction, where the models are required to generate both images and text pieces in an arbitrary order. Despite the emerging advancements in interleaved generation, the progress in its evaluation still significantly lags behind. Existing evaluation benchmarks do not support arbitrarily interleaved images and text for both inputs and outputs, and they only cover a limited number of domains and use cases. Also, current works predominantly use similarity-based metrics which fall short in assessing the quality in open-ended scenarios. To this end, we introduce InterleavedBench, the first benchmark carefully curated for the evaluation of interleaved text-and-image generation. InterleavedBench features a rich array of tasks to cover diverse real-world use cases. In addition, we present InterleavedEval, a strong reference-free metric powered by GPT-4o to deliver accurate and explainable evaluation. We carefully define five essential evaluation aspects for InterleavedEval, including text quality, perceptual quality, image coherence, text-image coherence, and helpfulness, to ensure a comprehensive and fine-grained assessment. Through extensive experiments and rigorous human evaluation, we show that our benchmark and metric can effectively evaluate the existing models with a strong correlation with human judgments surpassing previous reference-based metrics. We also provide substantial findings and insights to foster future research in interleaved generation and its evaluation.
more » « less
Full Text Available
Learning from a Friend: Improving Event Extraction via Self-Training with Feedback from Abstract Meaning Representation

https://doi.org/10.18653/v1/2023.findings-acl.662

Xu, Zhiyang; Lee, Jay Yoon; Huang, Lifu (August 2023, Association for Computational Linguistics)

Data scarcity has been the main factor that hinders the progress of event extraction. To overcome this issue, we propose a Self-Training with Feedback (STF) framework that leverages the large-scale unlabeled data and acquires feedback for each new event prediction from the unlabeled data by comparing it to the Abstract Meaning Representation (AMR) graph of the same sentence. Specifically, STF consists of (1) a base event extraction model trained on existing event annotations and then applied to large-scale unlabeled corpora to predict new event mentions as pseudo training samples, and (2) a novel scoring model that takes in each new predicted event trigger, an argument, its argument role, as well as their paths in the AMR graph to estimate a compatibility score indicating the correctness of the pseudo label. The compatibility scores further act as feedback to encourage or discourage the model learning on the pseudo labels during self-training. Experimental results on three benchmark datasets, including ACE05-E, ACE05-E+, and ERE, demonstrate the effectiveness of the STF framework on event extraction, especially event argument extraction, with significant performance gain over the base event extraction models and strong baselines. Our experimental analysis further shows that STF is a generic framework as it can be applied to improve most, if not all, event extraction models by leveraging large-scale unlabeled data, even when high-quality AMR graph annotations are not available.
more » « less
Full Text Available
Hyperspectral Image Super-Resolution in Arbitrary Input-Output Band Settings

Zhang, Zhongyang; Xu, Zhiyang; Ahmed, Zia; Salekin, Asif; Rahman, Tauhidur (January 2022, Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops)

Full Text Available
Hyperspectral Image Super-Resolution in Arbitrary Input-Output Band Settings

https://doi.org/10.1109/WACVW54805.2022.00082

Zhang, Zhongyang; Xu, Zhiyang; Ahmed, Zia; Salekin, Asif; Rahman, Tauhidur (January 2022, 2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW))

Full Text Available

Search for: All records